Search CORE

10 research outputs found

Group Affect Prediction Using Multimodal Distributions

Author: Rawat Bhanu Pratap Singh
Shamsi Saqib
Wadhwa Manya
Publication venue
Publication date: 12/03/2018
Field of study

We describe our approach towards building an efficient predictive model to detect emotions for a group of people in an image. We have proposed that training a Convolutional Neural Network (CNN) model on the emotion heatmaps extracted from the image, outperforms a CNN model trained entirely on the raw images. The comparison of the models have been done on a recently published dataset of Emotion Recognition in the Wild (EmotiW) challenge, 2017. The proposed method achieved validation accuracy of 55.23% which is 2.44% above the baseline accuracy, provided by the EmotiW organizers.Comment: This research paper has been accepted at Workshop on Computer Vision for Active and Assisted Living, WACV 201

arXiv.org e-Print Archive

Crossref

Conversational Machine Comprehension: a Literature Review

Author: Gupta Somil
Rawat Bhanu Pratap Singh
Yu Hong
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2020
Field of study

Conversational Machine Comprehension (CMC), a research track in conversational AI, expects the machine to understand an open-domain natural language text and thereafter engage in a multi-turn conversation to answer questions related to the text. While most of the research in Machine Reading Comprehension (MRC) revolves around single-turn question answering (QA), multi-turn CMC has recently gained prominence, thanks to the advancement in natural language understanding via neural language models such as BERT and the introduction of large-scale conversational datasets such as CoQA and QuAC. The rise in interest has, however, led to a flurry of concurrent publications, each with a different yet structurally similar modeling approach and an inconsistent view of the surrounding literature. With the volume of model submissions to conversational datasets increasing every year, there exists a need to consolidate the scattered knowledge in this domain to streamline future research. This literature review attempts at providing a holistic overview of CMC with an emphasis on the common trends across recently published models, specifically in their approach to tackling conversational history. The review synthesizes a generic framework for CMC models while highlighting the differences in recent approaches and intends to serve as a compendium of CMC for future researchers.Comment: Accepted to COLING 202

arXiv.org e-Print Archive

Crossref

Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study

Author: McManus David D.
Mitra Avijit
Rawat Bhanu Pratap., Singh
Yu Hong
Publication venue: eScholarship@UMassChan
Publication date: 02/07/2021
Field of study

BACKGROUND: Accurate detection of bleeding events from electronic health records (EHRs) is crucial for identifying and characterizing different common and serious medical problems. To extract such information from EHRs, it is essential to identify the relations between bleeding events and related clinical entities (eg, bleeding anatomic sites and lab tests). With the advent of natural language processing (NLP) and deep learning (DL)-based techniques, many studies have focused on their applicability for various clinical applications. However, no prior work has utilized DL to extract relations between bleeding events and relevant entities. OBJECTIVE: In this study, we aimed to evaluate multiple DL systems on a novel EHR data set for bleeding event-related relation classification. METHODS: We first expert annotated a new data set of 1046 deidentified EHR notes for bleeding events and their attributes. On this data set, we evaluated three state-of-the-art DL architectures for the bleeding event relation classification task, namely, convolutional neural network (CNN), attention-guided graph convolutional network (AGGCN), and Bidirectional Encoder Representations from Transformers (BERT). We used three BERT-based models, namely, BERT pretrained on biomedical data (BioBERT), BioBERT pretrained on clinical text (Bio+Clinical BERT), and BioBERT pretrained on EHR notes (EhrBERT). RESULTS: Our experiments showed that the BERT-based models significantly outperformed the CNN and AGGCN models. Specifically, BioBERT achieved a macro F1 score of 0.842, outperforming both the AGGCN (macro F1 score, 0.828) and CNN models (macro F1 score, 0.763) by 1.4% (P \u3c .001) and 7.9% (P \u3c .001), respectively. CONCLUSIONS: In this comprehensive study, we explored and compared different DL systems to classify relations between bleeding events and other medical concepts. On our corpus, BERT-based models outperformed other DL models for identifying the relations of bleeding-related entities. In addition to pretrained contextualized word representation, BERT-based models benefited from the use of target entity representation over traditional sequence representation

eScholarship@UMMS

Fine-Tuning Bidirectional Encoder Representations From Transformers (BERT)-Based Models on Large-Scale Electronic Health Record Notes: An Empirical Study

Author: Cai Pengshan
Jin Yonghao
Li Fei
Liu Weisong
Rawat Bhanu Pratap, Singh
Yu Hong
Publication venue: eScholarship@UMassChan
Publication date: 12/09/2019
Field of study

BACKGROUND: The bidirectional encoder representations from transformers (BERT) model has achieved great success in many natural language processing (NLP) tasks, such as named entity recognition and question answering. However, little prior work has explored this model to be used for an important task in the biomedical and clinical domains, namely entity normalization. OBJECTIVE: We aim to investigate the effectiveness of BERT-based models for biomedical or clinical entity normalization. In addition, our second objective is to investigate whether the domains of training data influence the performances of BERT-based models as well as the degree of influence. METHODS: Our data was comprised of 1.5 million unlabeled electronic health record (EHR) notes. We first fine-tuned BioBERT on this large collection of unlabeled EHR notes. This generated our BERT-based model trained using 1.5 million electronic health record notes (EhrBERT). We then further fine-tuned EhrBERT, BioBERT, and BERT on three annotated corpora for biomedical and clinical entity normalization: the Medication, Indication, and Adverse Drug Events (MADE) 1.0 corpus, the National Center for Biotechnology Information (NCBI) disease corpus, and the Chemical-Disease Relations (CDR) corpus. We compared our models with two state-of-the-art normalization systems, namely MetaMap and disease name normalization (DNorm). RESULTS: EhrBERT achieved 40.95% F1 in the MADE 1.0 corpus for mapping named entities to the Medical Dictionary for Regulatory Activities and the Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT), which have about 380,000 terms. In this corpus, EhrBERT outperformed MetaMap by 2.36% in F1. For the NCBI disease corpus and CDR corpus, EhrBERT also outperformed DNorm by improving the F1 scores from 88.37% and 89.92% to 90.35% and 93.82%, respectively. Compared with BioBERT and BERT, EhrBERT outperformed them on the MADE 1.0 corpus and the CDR corpus. CONCLUSIONS: Our work shows that BERT-based models have achieved state-of-the-art performance for biomedical and clinical entity normalization. BERT-based models can be readily fine-tuned to normalize any kind of named entities

eScholarship@UMMS

Entity-Enriched Neural Models for Clinical Question Answering

Author: Linguist Assoc Computat
Min So Yeon
Raghavan Preethi
Rawat Bhanu Pratap Singh
Szolovits Peter
Weng Wei-Hung
Publication venue
Publication date: 26/01/2021
Field of study

We explore state-of-the-art neural models for question answering on electronic medical records and improve their ability to generalize better on previously unseen (paraphrased) questions at test time. We enable this by learning to predict logical forms as an auxiliary task along with the main task of answer span detection. The predicted logical forms also serve as a rationale for the answer. Further, we also incorporate medical entity information in these models via the ERNIE architecture. We train our models on the large-scale emrQA dataset and observe that our multi-task entity-enriched models generalize to paraphrased questions ~5% better than the baseline BERT model

arXiv.org e-Print Archive

DSpace@MIT

Recommended from our members

Evidence Assisted Learning for Clinical Decision Support Systems

Author: Rawat Bhanu Pratap Singh
Publication venue: ScholarWorks@UMass Amherst
Publication date: 04/08/2023
Field of study

Clinical decision support systems (CDSS) provide intelligently filtered knowledge and patient-specific and population information to the clinicians, nursing staff and healthcare professionals. CDSS can significantly improve the quality, safety, efficiency and effectiveness of health care. Over the last decade, American hospitals have adopted electronic health records (EHRs) widely resulting in a massive collection of clinical notes such as admission notes, physician notes, nursing notes and discharge summaries. For the past couple of decades, most of the work in CDSS has been focused on developing knowledge-based systems using structured data such as medications and ICD codes. In contrast, the EHR notes incorporate rich and important information including adverse drug events, suicidal behaviors, and social determinants of health, all of which are substantially under-represented in the structured data. This presents a unique opportunity for natural language processing (NLP), with its ability to process a massive amount of EHR notes beyond the scope of human capability, to provide new clinical evidence previously missed out by any CDSS systems. We contribute to the NLP and clinical community by developing a robust multi-task learning framework for CDSS. First, we identified causality between medication and its adverse drug reactions using a clinically standardized assessment technique called Naranjo Scale. Our multi-task learning framework takes a question from Naranjo Scale, along with a patient\u27s note to identify relevant evidence sentences and paragraphs in the note and predicts the final answer for the question. Second, we extracted suicide attempt (SA) and suicide ideation (SI) events from patients\u27 clinical notes. We created the first publicly available suicide attempt and ideation events (ScAN) dataset. We then built a multi-task learning model ScANER (Suicide Attempt and Ideation Attempts Retriever) to extract the relevant suicidal behavior evidence from clinical notes. Next, we deployed multiple parameter-efficient transfer learning techniques to fine-tune the ScANER model for different hospitals’ EHR datasets. By fine-tuning less than ~2% of ScANER’s parameters on a small annotated data, ScANER is able to maintain a similar performance. To provide evidence for population-level CDSS for suicide prevention, we identified risk factors for suicidal behaviors using large EHRs (~7 million patients). We found that patients with traumatic brain injury and/or post-traumatic stress disorder are more than twice as likely to have suicidal behavior as compared to the control population. We also studied the prevalence of other risk factors, such as social determinants of health, extracted from EHR notes using different NLP approaches

ScholarWorks@UMass Amherst

Inferring ADR causality by predicting the Naranjo Score from Clinical Notes

Author: Jagannatha Abhyuday
Liu Feifan
Singh Rawat Bhanu Pratap
Yu Hong
Publication venue: eScholarship@UMassChan
Publication date: 25/01/2021
Field of study

Clinical judgment studies are an integral part of drug safety surveillance and pharmacovigilance frameworks. They help quantify the causal relationship between medication and its adverse drug reactions (ADRs). To conduct such studies, physicians need to review patients\u27 charts manually to answer Naranjo questionnaire(1). In this paper, we propose a methodology to automatically infer causal relations from patients\u27 discharge summaries by combining the capabilities of deep learning and statistical learning models. We use Bidirectional Encoder Representations from Transformers (BERT)(2) to extract relevant paragraphs for each Naranjo question and then use a statistical learning model such as logistic regression to predict the Naranjo score and the causal relation between the medication and an ADR. Our methodology achieves a macro-averaged f1-score of 0.50 and weighted f1-score of 0.63

eScholarship@UMMS

Recommended from our members

Relation Classification for Bleeding Events From Electronic Health Records Using Deep Learning Systems: An Empirical Study

Author: McManus David D.
Mitra Avijit
Pratap Singh Rawat Bhanu
Yu Hong
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2021
Field of study

Background: Accurate detection of bleeding events from electronic health records (EHRs) is crucial for identifying and characterizing different common and serious medical problems. To extract such information from EHRs, it is essential to identify the relations between bleeding events and related clinical entities (eg, bleeding anatomic sites and lab tests). With the advent of natural language processing (NLP) and deep learning (DL)-based techniques, many studies have focused on their applicability for various clinical applications. However, no prior work has utilized DL to extract relations between bleeding events and relevant entities. Objective: In this study, we aimed to evaluate multiple DL systems on a novel EHR data set for bleeding event-related relation classification. Methods: We first expert annotated a new data set of 1046 deidentified EHR notes for bleeding events and their attributes. On this data set, we evaluated three state-of-the-art DL architectures for the bleeding event relation classification task, namely, convolutional neural network (CNN), attention-guided graph convolutional network (AGGCN), and Bidirectional Encoder Representations from Transformers (BERT). We used three BERT-based models, namely, BERT pretrained on biomedical data (BioBERT), BioBERT pretrained on clinical text (Bio+Clinical BERT), and BioBERT pretrained on EHR notes (EhrBERT). Results: Our experiments showed that the BERT-based models significantly outperformed the CNN and AGGCN models. Specifically, BioBERT achieved a macro F1 score of 0.842, outperforming both the AGGCN (macro F1 score, 0.828) and CNN models (macro F1 score, 0.763) by 1.4% (P\u3c.001) and 7.9% (P\u3c.001), respectively. Conclusions: In this comprehensive study, we explored and compared different DL systems to classify relations between bleeding events and other medical concepts. On our corpus, BERT-based models outperformed other DL models for identifying the relations of bleeding-related entities. In addition to pretrained contextualized word representation, BERT-based models benefited from the use of target entity representation over traditional sequence representatio

ScholarWorks@UMass Amherst

Directory of Open Access Journals

Entity-Enriched Neural Models for Clinical Question Answering

Author: Linguist Assoc Computat
Min So Yeon
Raghavan Preethi
Rawat Bhanu Pratap Singh
Szolovits Peter
Weng Wei-Hung
Publication venue
Publication date: 26/01/2021
Field of study

DSpace@MIT